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Abstract 

Let G be any connected graph, and let Ai and p denote the spectral 
radius of G and the universal cover of G, respectively. In [LP09] . Linial 
and Puder have shown that almost every n-lift of G has all of its new 
eigenvalues bounded by 0(\\^ 3 p 2 ^ 3 ). Friedman had conjectured that 
this bound can be improved to p + o n (l) (e.g., see |Fri03l IHLW 06]). 

In |LP09] , Linial and Puder have formulated two new categorizations 
of formal words, namely <j) and (3, which assigns a non-negative integer or 
infinity to each word. They have shown that for every word w, <p{w) = 
iff f3(w) = 0, and <p(w) = 1 iff f3(w) = 1. They have conjectured that 
4>(w) = (5{w) for every word w, and they have run extensive numerical 
simulations that suggest that this conjecture is true. This conjecture, if 
proven true, gives us a promising approach to proving a slightly weaker 
version of Friedman's conjecture, namely the bound O(p) (see [LP09j), 

In this paper, we show that (ft(w) = 2 iff (3(w) = 2 for every word 
w. We also discuss possible strategies for proving (f>(w) = 3 iff (3(w) = 3. 

Keywords: graph eigenvalues, random graph lifts 

1 Introduction 

Let G be any connected graph with oriented edges. An n-lift of G is any 
graph that has an n-fold covering map onto G. Equivalently, an n-lift of G is 
any graph H with vertices V(H) — V x {1, ...,n}, and whose edges E(H) can 
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be obtained in the following manner: for every oriented edge (u, v) G E(G), 
we choose any permutation <T( u ,v) S n and add an (undirected) edge between 
(u, i) and (v, <J( u ,v)(i)) for i = 1, . . . , n. 

Now, let E{G) = {gi, . . . , gk}. Every fc-tuple of permutations in S n de- 
scribes an ra-lift of G. The random graph model we consider is the probability 
space L n (G) of n-lifts of G, with sample space S n k and uniform probability 
distribution. We note that when G is a single vertex with d/2 self-loops (with 
d even), this random graph model is the same as the "permutation model" 
for random <i-regular graphs. For background on lifts and random lifts, see 
[LR051 KL061 IALM021 IHEW06| . 

Our main interest is to study the eigenvalues of (the adjacency matrix 
of) random lifts of graphs. Let H be any n-lift of G. The projection 7r : 
V(H) — > V(G) defined by ir(u,i) = u is the natural covering map from H to 
G. It can be easily verified that if / is an eigenfunction of G, then / o tt is an 
eigenfunction of H with the same eigenvalue as /. The |V(Cr)| eigenvalues of 
H corresponding to the |V(G)| such eigenfunctions are said to be old, while 
the remaining n|V(Cr)| — |V(Cr)| eigenvalues of H are said to be new (note 
that a new eigenvalue can have the same value as an old eigenvalue). 

Let Ai and p denote the spectral radius of G and the universal cover of G, 
respectively. For H G L n (G), let p max (H) = max{|/i| : p is a new eigenvalue of 
H}. In |Fri03], Friedman showed that almost every n-lift H G L n (G) satisfies 
Pmax{H) < A^p 1 / 2 + o n (l). In [LP09j . Linial and Puder improved this bound 

to p m ax(H) < max 1 1,3 j ' A^ 3 p 2 / 3 + o n (l) for almost every n-lift 

H G L n (G). 

For the special case where G is a single vertex with d/2 self- loops (i.e., for 
the permutation model of random <i-regular graphs, with d even), p ma x(H) 
corresponds to X(H) := max{|A 2 |, |A n |}, where Ai > A 2 > • • ■ > A n are the 
eigenvalues of H. X(H) has been well-studied in the literature since it gives 
various information about H, such as the expansion properties of H and the 
rate of convergence of the random walk on H to the stationary distribution 
(see |HLW06j ). 

For the permutation model, Friedman's result in [Fri03j states that almost 
every random n-vertex ci-regular graph H satisfies \(H) < \j 2dy/cT^T+o n (l), 
which is a slight improvement of the result of Broder and Shamir in [BS87j . 
Linial and Puder's result in |LP09] states that almost every random n-vertex 
c?-regular graph H satisfies X(H) < 0(d 2 ^ 3 ), and more specifically, X(H) < 
(4d(d - 1)) 1/3 + o n {\) for d > 107. 

For various models of random <i-regular graphs (including this specific per- 
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mutation model), Friedman had shown that almost every random n- vertex d- 
regular graph H with d > 3 satisfies \(H) < 2y 'd — l + o n (l) (see [Fri08j ) . The 
Alon-Boppana bound (see [NiMl IFriD3] ) shows that X(H) > 2 v / cT^ r T- o n (l) 
for every n-vertex <i-regular graph H, so Friedman's result cannot be improved 
significantly, if at all. 

The results of [BS87j . [Pn03] . and jLP09] all use the Trace Method, which 
involves estimating the expected value of the trace of a high power of the ad- 
jacency matrix of a random graph. To estimate this expected value, Linial 
and Puder (in [LP09j ) study word maps associated with formal words over 
the alphabet E = = {<7i , ...,<7fc }■ Given a word w G £*, the word 
map associated with w maps the /c-tuple (<7i, . . . , cr^) G 5 n to the permuta- 
tion w(o~i, . . . , crfc) G 5* n , where w(eri, . . . , cr fe ) is the permutation obtained by 
replacing g 1: . . . ,g k with ci, . . . , cr*. (respectively) in the expression for w. 

The results of |BS87j . [Fri03j . and |LP09] all involve studying the probabil- 
ity that 1 (or any given point in {1, . . . , n}) is a fixed point of the permutation 
w(o~i, . . . , crfc), when <j\, . . . , <7fc G S n are chosen randomly with uniform distri- 
bution. We are interested in how close this probability is to -, which depends 
on the word w. 

To study this probability, Linial and Puder (in [L P09j ) formulated two 
new and separate categorizations of formal words, namely 0, (3 : S* — > Z>o U 
{oo}, which are invariant under reduction of words. Thus, <fi and f3 are also 
categorizations of the words in the free group F = generated by {gi, . . . , g k }. 
Intuitively, <j>(w) measures how close the above probability is to - for the word 
w (the higher 4>{w) is, the closer the probability is to -). On the other hand, 
P(w) is defined combinatorially without explicit reference to word maps and 
the symmetric group. Both <p and (3 extend the dichotomy of primitive vs. 
imprimitive words in F (recall that w G F is said to be imprimitive (as an 
element of F) if w — u d for some mgF and d > 2). In some sense, 4>(w) and 
P(w) can be thought of as quantifying the "level of primitivity" of the word 
w. 

In |LP09j . Linial and Puder have conjectured that 4>{w) = (3(w) for every 
word w. They have proven that <f>(w) = iff (3(w) — iff w reduces to the 
empty word, and <f>(w) = 1 iff /3(w) = 1 iff w is imprimitive as an element of 
F. These two facts also appear in [BS87] and [Fri03], but not in the explicit 
language of cj) and (3. Linial and Puder have also made partial progress towards 
proving <j)(w) = 2 iff j3(w) = 2, which allowed them to obtain a slightly better 
eigenvalue bound compared to the result in [Fri03] . Furthermore, they have 
run extensive numerical simulations, and the results suggest that 4>(w) = f3{w) 
for every word w. 

Friedman had conjectured that almost every n-lift H G L n {G) of G satisfies 
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Hmax(H) < P + o n (l) (e.g., see |Fri03l |HLW06| ). It is known that every n-lift 
H G L n {G) of G satisfies /w(#) > p-o„(l) (see [Gre95l lFntMlHLW06| ). so 
one cannot prove a significantly stronger result. The conjecture that (f)(w) = 
/3(w), if proven true, gives us a promising approach to proving a slightly weaker 
version of Friedman's conjecture, namely fi max (H) < O(p) (for almost every 
n-lift H G L n (G)) (see [LP09J). Also, if proven true, the conjecture may also 
significantly simplify the usage of the Trace Method in proving new (or old) 
eigenvalue bounds in various contexts. 

The 4>{w) = (3{w) conjecture is also interesting in other aspects, such as 
its connection to word maps. For example, a slightly stronger version of this 
conjecture made in [LP09j implies that for every word w and sufficiently large 
n, w(<Ji, . . . , <7fc) has at least one fixed point on average. 

Our work mainly builds on the work of Linial and Puder in [LP09] . In 
particular, we show that <f)(w) = 2 iff (3{w) = 2 for every word w G £*. For 
those already familiar with the paper [LP09j . we briefly mention some aspects 
of our proof. For a word w G X* such that /3(w) > 2, recall that there exists 
a natural surjective function from the (connected) components of some graph 
T to the quotients in Q w that have characteristic 2 and type A (see |LP09j ). 
Linial and Puder believed that this function is also injective, which would 
show that 4>{w) = 2 iff (5{w) = 2. In this paper, we follow this strategy and 
prove that this function is actually injective. This is done by investigating 
the relationship between the components of T and proving a certain property 
about them. The main technique we use a recursive factoring process that 
decomposes the word w into finer and finer pieces, which allows us to analyze 
the components of T. 

In this paper, we also discuss possible strategies for proving <j)(w) = 3 iff 
f3(w) = 3. Although our result does not give us a new eigenvalue bound, 
the strategy and techniques we use may be useful in proving the 4>(w) = f3(w) 
conjecture, which leads to a promising approach to proving the slightly weaker 
version of Friedman's conjecture. Our result is also interesting in the context 
of word maps. For example, the result tells us more about the probability that 

1 is a fixed point of the permutation w(ai, . . . , a*.) (when a±, . . . , G S n are 
chosen randomly with uniform distribution), and how much this probability 
differs from -. 

n 

2 Review of (f> and (3 and Related Concepts 

In this section, we review some concepts and terminology from [LP09j . as well 
as introduce some new terminology for convenience. 
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Fix k > 1, and let E = E fe = {g^ 1 , . . . , g^ 1 } be an alphabet of 2k ab- 
stract letters. E* denotes the free monoid of words generated by E, and the 
quotient of E* modulo reduction of words is the free group F = gener- 
ated by {gi, . . . ,gk}- For every word w G E* and n G Z + , we have a ran- 
dom variable on S% (with the uniform probability distribution); specif- 
ically, Xw\ci, . . . , <7fc) is the number of fixed points of w(ai, . . . , a k ), where 
w(a 1 , . . . , <7fc) is the permutation in S n obtained by replacing g x ,...,g k by 
(Ti,...,(7fc, respectively, in the expression for w. Then, & w (n) is defined as 

®w( n ) = E( - Xw n • > ~ 1 = E ^ Xw ^ — -. E( - X ™ - is the probability that 1 (or any given 
point in {1, . . . , n}) is a fixed point of the permutation w(ai, . . . , a k ), when 
01, . . . , <7fc G are chosen randomly with uniform distribution. Thus, <£v(n) 
measures how much this probability differs from ^ for the word w. 

It can be shown that for every word w G E*, & w can be expressed as a 
power series in -; specifically, for every word w G E* and n > \w\, we have 

&w(n) = Et " Xw n - 1 = Xl^o a «( u ')^') where the coefficients a,i(w) are integers 
depending only on w. For any word w G E*, </>(u>) is defined to be the least 
non-negative integer % such that cii(w) ^ 0, or oo if no such i exists (i.e., 
if E(xi™^) = 1 for n > \w\). Thus, <f>(w) measures how much the above 
probability differs from ^ for the word w. The higher (f>(w) is, the closer the 
probability is to -. 

We now describe some terminology needed to define /3(w) for any word w G 
E*. Fix any word w = g^ g£i gi\^\ G E*, where i\, i 2 , . . . , i\ w \ G {1,2, ... ,k} 
and cci, a 2 , . . . , cx\ w \ G { — 1, 1}, and consider the following directed edge-labeled 
graph: 

«i c 2 "3 "IH-i „"IH 

r -> ffil r -> 9i 2 r -> 9*3 Sl M-l r -> 9l M r -> 
{So} ► {Si} > {S 2 \ >■■• > {S\w\-l\ ► {S\ w \\ 

For convenience, we shall change the representation of the edges of the graph 
so that all the labels are integers in {1, . . . , k}. Specifically, every edge with 
label gi a is relabeled as i, and if a — — 1, the orientation of the edge is reversed 
also. We shall call the resulting graph the open trail of w, and we denote it 
by T w . E.g., for the word w = g\g 2 g 2 g 2 ~ x g^g-ig^ 1 , the open trail T w of w is the 
following directed edge-labeled graph: 

{ s o} {si} {s 2 } {s 3 } <— {s 4 } {S5} {s 6 } ^— {s 7 } 

Fix a word w G E*. A quotient of T w is defined as any directed edge- 
labeled graph whose vertex set is a partition of {sq, . . . , s\ w \}, and there is 
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a j-labeled edge (j-edge for short) from vertex U to vertex V iff there exist 
U', V G V(T W ) such that U' C U, V C V, and T w contains a j-edge from [/' to 
V. Equivalently, a quotient of T m is a graph obtained by "joining" vertices of 
T w together in any combination; when two or more vertices are joined together, 
the resulting vertex is the union of the sets representing the original vertices, 
and any redundant multiple edges that have the same label and orientation 
are removed from the resulting graph. All the directed edge-labeled graphs in 
this paper do not allow two distinct j-edges to have the same head vertex and 
the same tail vertex. 

Given a quotient T of T w , we can also define a quotient of T in a similar 
manner. Specifically, a quotient of T is any directed edge-labeled graph T' 
whose vertex set V(T') is a partition of {s , . . . , s\ w \} that is coarser than 
V(r); also, there is a j-edge in V from vertex U' to vertex V iff there exist 
U, V G V(r) such that U C U', V C V', and T contains a j-edge from U to 
V . Again, equivalently, a quotient of Y is a graph obtained by joining vertices 
of r together in any combination. 

We note that a quotient of a quotient of V is a quotient of V . We also note 
that there are natural bijections between the following sets: {quotients of T}, 
{partitions of {so, . . . , s\ w \} coarser than \^(r)}, and {partitions of V(r)}. 

A walk in a quotient of T w traces out a word in E*. When a j-edge is 
traversed with the same (respectively, opposite) orientation as the edge, the 
letter gj (respectively, gj 1 ) is traced out. We note that the word w is traced 
out by the walk that visits the vertices containing sq, . . . , s\ w \, respectively, in 
this order, and choosing the correct edge to traverse at each step according to 
w. 

Let T be any quotient of T w . Given s p , s q G {so, . . . , s\ w \}, we write s p =r s q 
if s p and s q are in the same vertex of T, and s p s q otherwise. Also, given 
U, V C {so, • . • , s\ w \}, we write U =r V if U C W and V C W for some vertex 
W in T, and U V otherwise. We may omit the subscript Y if the quotient 
being considered is clear from context. 

Let T be any quotient of T w , and let S = {{sj 1 , s^}, ■ ■ ■ , {sj r , Sk r }} be 
any set whose elements are two element subsets of {so, • • • , s\w\}- The quo- 
tient ofT by S, denoted T/S, is the quotient whose vertex set (a partition of 
{s , • • • , s\ w \}) is defined by the following condition: s p , s q G {s , . . . , s\ w \} are 
placed in the same block of the partition iff s p = s q holds in every quotient of 
T that satisfies Sj i = for alH = 1, . . . , r. One can easily verify that this 
condition defines a valid partition of {so, ■ ■ ■ , s\ w \} that gives a quotient of V. 
For convenience, we write T/{sj, sj,} for F/{{sj, s^}}. 

Intuitively, for Sj,Sk G {s , . . . , s\ w \} with Sj ^ s k , T/{sj,s k } is the quo- 
tient of T obtained by joining the two vertices of T that contain Sj and s k , 
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respectively. F/S is the quotient of F obtained by performing this joining op- 
eration for each element {sj v s^} in S; we easily see that the order in which 
the joining operations are performed does not matter. If S is a set of two 
element subsets of V(F), or if S is a partition of V(F), we define F/S in a 
similar manner. 

A quotient of T w is said to be realizable if for every j G {1, . . . , k}, the 
quotient does not contain a pair of distinct j-edges that have the same head 
vertex or the same tail vertex. Let F be any quotient of T w . The realization 
of T, denoted rel(F), is the quotient of F whose vertex set is defined by the 
following condition: s p , s q G {s , . . . , s\ w \} are placed in the same vertex iff s p = 
s q in every realizable quotient of F. One can easily verify that this condition 
defines a valid partition of {sq, . . . , s\ w \} that gives a realizable quotient of F. 

Intuitively, rel(F) is the quotient of F obtained by repeatedly joining any 
two vertices that "cause the current quotient to be not realizable", until we 
get a realizable quotient. If a quotient contains two distinct j-edges that have 
the same head (respectively, tail) vertex, then the two tail (respectively, head) 
vertices "cause the quotient to be not realizable" . We easily see that we get the 
same quotient no matter how we choose exactly which pair of "nonrealizability- 
causing" vertices to join at each step of the process. It can also be verified that 
rel(F/S) can be obtained from F by repeatedly joining pairs of vertices that 
either are specified by S or are causing nonrealizability, in any order, until the 
quotient is realizable and Sj = s& for each {sj, s&} G S. 

Given any quotient F of T w , we define the (Euler) characteristic of F, 
denoted x(H, to be x( r ) = \E(T)\ - \V(F)\ + 1. Let S be any set of two 
element subsets of {so? • • • , s\w\}- The quotient of T w generated by S is the 
quotient rel(T w /S). By definition, the quotient generated by S is realizable. 
Conversely, every realizable quotient F of T w is generated by at least one set of 
two element subsets of {so, . . . , s\ w \\, and any such set is called a generating set 
for T. It can be shown that for any realizable quotient F of T w , any minimum 
generating set (in size) for F contains exactly x(F) elements. 

The collection of all realizable quotients of T w such that s = s\ w \ is denoted 
by Q w . We now have the necessary tools to categorize the quotients in Q w 
and define (3(w). We say that a quotient F G Q w has type A if one of the 
smallest generating sets for F contains the pair {so? s h}; otherwise, we say 
that T has type B. Now, we define (3(w) to be the smallest characteristic of a 
type-B quotient in Q w , or oo if no type-B quotient exists. 

Earlier, we defined <& w (n) as <& w (n) = E( " Xw n ~ > ~ 1 . However, for n > \w\, it 
can be shown that there is an alternative expression for & w (n) in terms of the 
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quotients in Q w , namely 



= — + V 



n ' 



where t>r and ef denote the number of vertices and the number of j-edges in 
T, respectively. 

Let -P(r) denote the big expression inside the sum in (JT]). Then, & w (n) 
can be written more concisely as § w (n) = — ~ + J2 reQm P(T). P(T) is the 
probability that the walk starting at the point 1 and moving according to 
the permutation w(a%, . . . , a^) traces out the quotient Y and visits points in 
the same "coincidence" pattern as the walk in V that traces out w and visits 
the vertices containing so, . . . , s\ w \, respectively, in this order. We note that 
P(r) is fully determined by vr and ef , j = 1, . . . , k, and does not depend on 
the word w. However, Q w does depend on w and is the set of quotients that 

are summed over in (PQ). We note that Er G g„ P ( r ) = £d ^ ) which is the 
probability that 1 is a fixed point of the permutation w(<Ji, . . . , o^). 

The power series for $^ comes from the expression in ([1]). Indeed, we can 
rewrite ([1]) as 



® w (n) -- 

n c — ' \nj -*■-"■ \ n/-"--"-- , -- , -\ n 

r&Q w v 7 i=i v 7 j=i i=i v 

and then further simplify the expression to obtain a power series for & w . 

We note that both 4>{w) and (3{w) are fully determined by the quotients in 
Q w . Thus, we can study both of these parameters by studying Q w . In [LP09J, 
Linial and Puder have conjectured that <j)(w) = f3(w) for every word w G E*. 
They have shown that for every word w 6 S*, 4>{w) = iff (3(w) — iff w 
reduces to the empty word, and 4>{w) = 1 iff (3{w) — 1 iff w is imprimitive as 
an element of F (i.e., w = u d for some u € F and c? > 2). They have also 
made partial progress on showing that <p(w) = 2 iff /3(w) = 2 for every word 
w G £*. 

It is known that both and /3 are invariant under reductions, cyclic 
shifts, cyclic reductions, and automorphisms of F (i.e. 4>(f(w)) = 4>{w) and 
f3(f{w)) = (3{w) for any word w G S* and any group automorphism / of F; 
4>(f(w)) and (3{f{w)) are well-defined because and /? are invariant under 
reductions). 

The quotients in Q w are the realizable quotients of a special graph in Q w , 
namely rel(T w / {so, S| w |}), which we call the universal graph ofw and denote by 
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T w . Recall that any minimum generating set for a realizable quotient T of T w 
contains exactly x(T) elements. Thus, T w is the only quotient in Q w that can 
possibly have characteristic 0, and this occurs iff T w := rel(T w / {s , s\ w \}) = 
rel(T w ), which occurs iff has type B; otherwise, T w has type A and char- 
acteristic 1. T w is also the only quotient in Q w that can have characteristic 1 
and type A; all other quotients in Q w with characteristic 1 have type B. 

3 The Identification Graph of a Realizable Quo- 
tient 

In this section, we define the identification graph of a realizable quotient, which 
has already been introduced in |LP09] . However, we discuss the identification 
graph further and introduce some new terminology. The concepts and termi- 
nology here will be used in our proof of <p{w) = 2 iff (3{w) = 2 for every word 
w g £*. 

Fix w G E*, and let T be any realizable quotient of T w . Suppose that we 
join two vertices of T to obtain another quotient T'. V may not be realizable, 
so additional pairs of vertices may need to be joined in order to obtain the 
realizable quotient rel(T'). Thus, joining two vertices of T may imply further 
joins required in order to maintain realizability. 

To study exactly which vertices of T become joined, we define a new di- 
rected edge-labeled graph called the identification graph of T, denoted Ir- The 
vertices of Ir are pairs of distinct vertices of T, and there is a j-edge from ver- 
tex {U, V} to vertex {[/', V'} iff T contains a j-edge from U to some vertex 
in {U',V}, and a j-edge from V to the other vertex in {U',V} (since T is 
realizable, these two j-edges cannot point to the same vertex in {U ; , V'}). We 
note that for every pair {e, e'} of distinct j-edges in T, there is a j-edge in 
I r from {tail(e), tail(e')} to {head(e) , head(e')} (where head(e) refers to the 
vertex pointed to by the directed edge e). These are all the edges of Ir, so 

Ir contains exactly Ylj=i (if) edges. We also note that I r does not contain a 
pair of distinct j-edges that have the same head vertex or the same tail vertex, 
since T is realizable. For convenience, we shall write I w for Jr w . 

Given any directed edge-labeled graph G and a vertex v G G, let d 3 Q (v) = 
\{e G E(G) : tail(e) = v}\, d£(v) = \{e G E(G) : head(e) = v}\, and d j G {v) = 
d 3 Q{v) + d 3 (7(v). Subscripts are omitted when the graph under consideration 
is clear from context. 

Proposition 1. Let w G S* ; and let T be any realizable quotient ofT w . Then, 
for every {U, V} G Ir, we have d({U, V}) < mm{d(U) , d(V)} . In particular, 
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we have S(Ir) < S(T) (provided that V(Ir-) nonempty) . 

Proof. Let G J r . WLOG, suppose that d(C/) = min{d(U), d(V)}. 

Let j G {1,...,*}. It suffices to show that d j+ ({U, V}) < d j+ (U) and 
d?~({U, V}) < dP~(U). Since Jr does not contain a pair of distinct j-edges that 
have the same head vertex or the same tail vertex, we have di + ({U, V}) < 1 
and d j ~({U, V}) < 1. Now we note that if d j+ ({U, V}) = 1, then by the con- 
struction of Ir, there exists a j-edge in T from U to some vertex, so <# + ([7) > 1. 
Similarly, if d j '({U, V}) = 1, then d j ~{U) > 1. □ 

Let w G £*. Since both and /? are invariant under cyclic reductions, we 
can assume that w is cyclically reduced. Furthermore, we already know exactly 
when 4>{w) = 0, (3{w) = 0, <f>(w) = 1, and (3{w) = 1. Thus, in our investigation 
of and /3(w), we can also assume that <f>(w) > 2 and (3{w) > 2. We will 
often be making these convenient assumptions. 

Proposition 2. Let w £ Y>* be a cyclically reduced word such that <p(w) > 
2 (and hence /3(w) > 2). Then, I w is a disjoint union of paths (including 
isolated vertices) with some orientation of its edges (not necessarily the same 
orientation within each path). In particular, I w contains no cycles (with any 
orientation of each of the cycle's edges). 

Proof. Since w is cyclically reduced and is not the empty word (since <f)(w) > 
2), T w is a cycle. Thus, d~(T w ) = 2, so S(I W ) < 2 by Proposition [TJ Thus, I w is 
a disjoint union of paths and cycles. It has already been shown in [LP09j that 
I w contains no cycles. □ 

Let w G £*, and let T be a realizable quotient of T w . Using J r , we can see 
what additional vertices are joined when pairs of vertices are joined in T and 
then a realization operation is performed. For example, suppose that distinct 
vertices v\, vi are joined in T, and then a realization operation is performed to 
obtain the quotient rel(T/{vi,v 2 })- Let C be the component of Ir containing 
the vertex {t>i,t>2}. By the construction of Ir, all the pairs of vertices of T in 
the component C also become joined during the realization operation. It is 
clear that joining any of the pairs of vertices of T in C would yield the same 
quotient after the realization operation is performed. Not only do the pairs in 
C become joined, but the pairs in other components of I-p may become joined 
as well. For example, suppose that {t>2, ^3} is also in the component C. Then, 
if Vi and v-i are joined in T, v 2 and W3 also become joined. By transitivity, these 
two joins automatically imply that Vi and v 3 are joined. However, {vi, v 3 } may 
not be in the component C, but instead may be in the component C . Thus, 
all the pairs in C would be joined as well. 



10 



To determine exactly which pairs of vertices of T become joined, we start 
with the set S consisting of all the vertices in C. Then, if S contains {vi,Vj} 
and {vj, Vk} with Vi 7^ Vk but does not contain {y j, Vk}, we take all the vertices 
in the component of Ir that contains {vi,Vk}, and we add these vertices to 
S. We repeat the previous step until no more vertices need to be added to S. 
We note that S naturally defines a symmetric relation on V(r), and we can 
insist that the relation defined is also reflexive. By our construction of S, the 
relation defined by S is also transitive, so S defines an equivalence relation on 
V(r). We note that S also induces a set of components of Ir- We see that 
S describes exactly which pairs of vertices of T become joined; i.e., the set S 
contains precisely the pairs {v, v'} G V(Ir) for which v = v' in rel(T/ {vi, v 2 })- 

We note that if T is a set of vertices of Ir that defines a reflexive, symmetric 
relation on V(T) that is also transitive (i.e., an equivalence relation), and T 
induces a set of components of It, then T naturally defines a realizable quotient 
of T. The equivalence relation defined by T gives a valid quotient of T, and the 
condition that T induces a set of components of Ir ensures that the quotient 
is realizable. 

Let Y denote the set of components of I-p induced by S. We see that 
Y is the (unique) smallest set of components of Ir that contains C and is 
closed under transitivity of pairs, i.e., if the collection of components contains 
the pairs {vi,Vj} and {vj,Vk} with Vi 7^ Vk, then the set of components also 
contains the pair {vi, Vk}- (In other words, a collection of components is closed 
under transitivity of pairs if the reflexive, symmetric relation on V(r) defined 
by the set of vertices in the collection of components is transitive.) We easily 
see that if Z is another set of components of 7 r that contains C and is closed 
under transitivity of pairs, then Y C Z. 

We say that a component C of Jr implies component C if for every (equiv- 
alently, some) pair {U,V} G C, we have U = V in rel(T/V(C)). Intuitively, 
C implies C if joining a pair of vertices of V in C implies that each of the 
pairs of vertices of Y in C also become joined when a realization operation 
is performed. From our discussion above, we see that the set of components 
implied by a component C is the smallest set of components that contains C 
and is closed under transitivity of pairs. Furthermore, the set of components 
implied by C is contained in any such set of components. 

4 Proof of cf)(w) = 2 iff (3(w) = 2 

The main result of this section is that 4>(w) = 2 iff f3(w) = 2 for every word 
w G £*. In the proof, we will be factoring the word w recursively into finer 
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and finer pieces. Lemma [2] below helps us perform this factoring process, while 
Lemma [T] is used to prove Lemma [2J We first prove Lemma [TJ which is related 
to factoring certain words into a particular form. Let e denote the empty word. 

Lemma 1. Let x,y, z G X* with xy = yz and x 7^ e. Then x = uv, y = (uv) p u, 
and z = vu for some u, v G X* and p > 0. 

Proof. We prove the lemma by induction on \y\. If \y\ — (i.e., y = e), then 
x = z, so we let u = a, v = x, and p = 0, as required. Now, suppose that 

M>i- 

If \y\ < \x\, then since xy = yz, y is a prefix of x. Thus, 2 = yx' for some 
x' G X*. Since = yz, we have yx'?/ = yz, so x'y = 2. Now, we let u = y, 
v = x', and p = 0, as required. 

If |y| > |x|, then since xy = yz, x is a prefix of y. Thus, y = xy' for some 
y' G X*. Since x 7^ e, we have < xy = yz implies that xxy' = xy'z, so 
xy' = y'z. By the induction hypothesis, we have x = uv, y' = (uv) p , z = vu 
for some u,v G X* and p > 0. We also have y = xy' = uv{uv) p = (uv) p+1 , as 
required. □ 

We now prove Lemma [2], which allows us to factor a pair of words with 
differing last letter into one of several different forms. Let S + = X* \ {e}, and 
for x G X + , let x\ x \ denote the last letter of x. 

Lemma 2. Let x, y G X + such that x\ x \ 7^ y\ y \. Then, x and y can be factored 
into one of the following forms: 

(1) x = uv and y = (uv) p ub for some u,v,b G X + and p > such that b\ 7^ v\ 

and b\ b \ 7^ v\ v \. 

(2) x = u and y = u p b for some u, b G X + and p > such that bi 7^ U\ and 

b\b\ 7^ 

(3) x = uv and y = [uv) p u for some u, v G X + and p > suc/i i/iat U| w | 7^ v\ v \ . 

(4) x = u and y = u p for some u G X + and p > 1. 

Proof. Let e be the longest common prefix of xy and y. Then, y = eb for some 
6 G X*, so xy = xeb. We can also write xy = ecb for some c G X + . Thus, 
xe6 = ecb, so xe = ec. By Lemma [Q, we have x = m>, e = {uv) p u, and c = vu 
for some it, w G X* and p > 0. Thus, y = eb = {uv) p ub. Since x 7^ e, we cannot 
have m = i> = e. 
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If u 7^ e, v 7^ e, and 5 7^ e, we have 61 7^ Ci by definition of e, so 61 7^ V\. 
Since xui 7^ yi^i, we have b\u 7^ uui. Thus, we can write x and y in the form 

(I)- 

If b 7^ e and exactly one of w, v is equal to e, then we have b\ 7^ c\ by 
definition of e. Letting z equal the word in {u,v} that is nonempty, we see 
that bi 7^ Z\ and 7^ as above. Renaming z as u, we can write x and y 
in the form (2). 

If u 7^ e, v 7^ e, and b — e, then we have Wi u i 7^ uui, since 2?m 7^ j/ui. Thus, 
we can write x and y in the form (3). 

If b = e and exactly one of it, v is equal to e, we can write x and y in the 
form (4) (by renaming the nonempty word in {u, v} as u). We note that p > 1 
in this case because y 7^ e. □ 

Theorem 1. Lei u> 6e any word in E*. Then, <p(w) = 2 iff (3{w) = 2. 

Proof. WLOG, we can assume that w is cyclically reduced, <f>(w) > 2, and 
> 2. Since > 2, no quotients in have characteristic 0, and only 
the universal graph T w has characteristic 1. Thus, one can easily verify from 

© that a 2 (w) = -(**«) + £* = i (If) + |{r G Q w : x(T) = 2}|. We will show 

that |{r G : x(r) = 2 and T has type A}| = (" r 2 -) - £* =1 (If)- It then 
follows that d2(V) = |{T G Q w : xCO = 2 and T has type B}|, so <f>(w) = 2 iff 
P(w) = 2. 

To show that \{T E Q w : x(T) = 2 and T has type A}| = (^»)-Ej=i (If) > 

we first note that ("a 1 ") ~~ Sj=i C^ 1 ") * s ec L ua l to the number of (connected) com- 
ponents of the identification graph I w , since /„, contains no cycles (Proposition 
HJ. It suffices to show that the number of components of I w is equal to |{T G 
Qw '■ x(r) = 2 and T has type A}|. Each component C of I w gives a quotient 
in Q w that has characteristic 2 and type A, namely rel(T w /V(C)), or equiv- 
alently rel(T w /{U, V}) for any {£/, V} G V(C7). (Each quotient re^/V^C)) 
cannot have characteristic less than 2 because if it did, it would have type B.) 
This defines a surjective function / from the components of I w to the quotients 
in Q w that have characteristic 2 and type A. It suffices to show that / is also 
inject ive. 

Firstly, we recall that the components of I w are paths (Proposition [2]). 
Thus, each component has a length (isolated vertices have length 0). Recall 
that we say component C implies component C if U =f(c) V for every pair 
{U, V} G C . We will prove the following claim: If a component C implies a 
different component C, then the length of C is strictly longer than the length 
of C This proves the main theorem, since if f(C) = /(C), then C and C 
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would imply one another, and if C ^ C, then C would be strictly longer than 
C, and C would be strictly longer than C; this is clearly a contradiction. 

We now prove the above claim. Let C be any component of I w , and let 
{Uo, Vq} and {Ui, V{\ be the two endpoints of C . The non-backtracking walk 
from {Uq,Vq} to {Ui,Vi} corresponds to a pair of distinct non-backtracking 
walks in T w . (These two walks are non-backtracking because I w does not 
contain a pair of distinct j-edges that have the same head vertex or the same 
tail vertex). WLOG, we can assume that the two walks are from Uq to U\ and 
from Vq to Vf, we denote the two walks by wu 0t Ui an d w\r ,v t , respectively. 

We claim we can assume that the two subgraphs traced out by the two 
walks do intersect. To see this, suppose that the two subgraphs don't inter- 
sect. Then, none of the vertices in C intersect (as sets) nontrivially with one 
another. Thus, C is closed under transitivity of pairs, so C does not imply 
any components other than itself, and we are done. (Recall that the set of 
components implied by C is the smallest set of components that contains C 
and is closed under transitivity of pairs.) 

Since w is nonempty and cyclically reduced, T w is a cycle. We now claim 
that the two walks must move in the same direction along the cycle. Suppose 
not. We first note that the two walks start at different vertices. Since the two 
subgraphs traced out by the two walks do intersect, the two walks must meet 
each other either by visiting the same vertex simultaneously, or by traversing 
the same edge simultaneously. By the construction of both walks simul- 
taneously traverse same-labeled edges with the same orientation (either both 
consistent with the edges' orientation, or both inconsistent). Thus, visiting 
the same vertex simultaneously would imply that two distinct j-edges have 
the same head or the same tail, contradicting the realizability of T w . Travers- 
ing the same edge simultaneously is clearly impossible. 

We recall that the set of components implied by C is contained in any set 
of components (of I w ) that contains C and is closed under transitivity of pairs. 
Thus, to complete the proof, it suffices to describe a set of components that 
is closed under transitivity of pairs and whose unique longest component is 
C. Our strategy is to recursively factor the word w into finer and finer pieces 
(using Lemma [2]), until certain conditions are met. At the end of the factoring 
process, we will have a factorization of w that allows us to describe a set of 
components that have the desired properties. 

We now describe how to obtain our initial factorization of w. Since I w 
contains no cycles, the two walks wu ,u t and wv ,v t cannot go around the cycle 
of and return back to their respective starting vertices. WLOG, we can 
assume that s G Uq and that Wu ,ui t races ou t a prefix of w\ i.e., the word w is 
traced out by the walk that starts at Uq, moves in the same direction as wu ,u n 
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Figure 1: Example of how T w may look like. The two walks wu ,Ui and 
start at vertices Uo and Vo, respectively, and move clockwise around the cycle 
for some distance. The word w is traced out by the walk from U to itself, 
going clockwise around the cycle once, x is the prefix of w traced out by the 
walk from Uq to V , and y is the remaining suffix of w. 



and goes once around the cycle back to U . This assumption is possible because 
if w' is a cyclic shift of w, or if w' = then T w and T w i are isomorphic as 
directed edge-labeled graphs; thus, there is a natural isomorphism between I w 
and I w i that preserves the implications of each component. 

Let x be the prefix of w traced out by the walk from Uq to Vo, and let y be 
the remaining suffix of w so that w = xy (see Figured]). We note that x ^ e 
and y ^ e. Also, since {Uo, Vo} is an endpoint of C, we have x\ x \ ^ y\ y \. Using 
Lemma Ej we factor w into the form where is one of the following 
expressions: 

(1) = where Pl > 1, bf ] ^ v{ 1] , and fcfj ^ v$ . 

(2) w« = (w«) Pl &« where Pl > 2, ^ u{\ and fcfj ^ u[*}. 

(3) u;W = ( u W«W) w uW where pi > 1 and ^ ufj. 

(4) uA 1 ) = (uW)" 1 where pi > 2. 

Above, uW,?;' 1 ', and are elements of S + . In (2) above, we have pi > 2 
because p > 1 in (2) of Lemma [21 since p = would imply that y = b, so 
yi 7^ xi, contradicting the fact that wu ,u t and uv ,Vi intersect. We note that 
w cannot be factored into the form (4) above, since w is not imprimitive (since 
4>(w) 7^ 1). Figure [2] shows how the factorization u>W of w may look like in 
the universal graph T w . 
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(1) 



(2) 



(3) 



Figure 2: Examples of how the factorization of w may look like in T w . 
The three graphs correspond to the factorizations (1) - (3) above. Note that 
factorization (4) is not possible because w is not imprimitive. 

We will later describe how to further factor w, if necessary. As we re- 
cursively factor w, the expressions for w become increasingly complicated, so 
we will not be writing them out explicitly. However, the first factorization of 
w has certain properties, and these properties are preserved throughout the 
factoring process. The stopping condition for the factoring process guaran- 
tees that the final factorization has certain additional properties. All of these 
properties combined together allow us to describe a set of components that is 
closed under transitivity of pairs and whose unique longest component is C. 

To help us describe such a set of components, we introduce the notion of a 
"path" in an explicit factorization of w. "Paths" in the final factorization of w 
will be used to describe a set of components that is closed under transitivity of 
pairs. To show that C is the longest component in this set of components, we 
will describe a property regarding "paths" that the factorization has, and 
we will show that this property is preserved throughout the factoring process. 
In the final factorization of w, we will be able to use this property to show 
that C is the longest component in the set of components. 

Given an explicit expression iyW representing a factorization of w, we define 
a path in to be a 2-tuple (n, z) where n is an integer specifying the position 
of the symbol the path starts at, and z is a finite subexpression and prefix of 
s(tyW)°° ((w^) 00 is the concatenation of with itself infinitely many times), 
where s is the subexpression and suffix of starting at the nth symbol. For 
example, if iyW — uvzxyzzy, then (2,vzx), (7,zyuvz), and (4, xyzzyuvzxy) 
are all paths in Note that in the definition of a path in the symbols 
in are regarded as formal symbols, so the actual words in S* represented 
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by the symbols do not matter. 

The two subexpressions for two paths in iyW are equivalent iff both subex- 
pressions contain the exact same sequence of formal symbols, in which case 
we say that the two paths are subexpression- equivalent. A path is also said to 
trace out the word (in £*) represented by the path's subexpression. A pair 
of subexpression-equivalent paths is said to be of maximal length if the two 
paths cannot be extended (in the forward direction, while keeping the starting 
points fixed); i.e., the symbol following the end of one path is not the same as 
the symbol following the end of the other path. 

Let m denote the position of the letter in w (with all of its letters written 
out) corresponding to the first letter of the second occurrence of in 
We note that the pair of walks in F w corresponding to C starts at the pair of 
vertices right before wi and w m in T w . m remains fixed throughout the rest 
of the proof. 

One can easily verify that satisfies properties (5) - (7) below (with 
% — 1; properties (5a) - (5c) correspond to (1) - (3) above): 

(5) it;W satisfies one of the following properties: 

(5a) it;W is some expression in u^\v^\b^ E S + (using all of these 
symbols and nothing else) such that bf* ^ vf* and b^ (i) , ^ v ui)r 

Also, no occurrence of in iyW is cyclically followed or preceded 
by (By "cyclically" , we mean that the symbol following the last 
symbol of is the first symbol of , and the symbol preceding 
the first symbol is the last symbol.) 

(5b) iyW is some expression in b^ E S + (using all of these symbols 
and nothing else) such that ^ and fej^o ^ u fyi)r 

(5c) iyW is some expression in u^\v^ E S + (using all of these symbols 
and nothing else) such that ^ v \%)\- 

(6) Any occurrence of or i n W W i s cyclically followed and preceded by 

(7) The first symbol of is u®. The letter w m corresponds to the first letter 

of a non-first occurrence of in The pair of distinct subexpression- 
equivalent paths in that traces out the longest word is the pair of 
paths that start at the first occurrence of and the occurrence of 
described above, respectively, and is of maximal length. Furthermore, 
this pair of paths is unique. 
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Now, we describe a process that generates a finite sequence w^ 1 ', . . . , 
of factorizations of w. We will show that each of the factorizations satisfies 
properties (5) - (7). We start with the expression which satisfies proper- 
ties (5) - (7). Let i > 1, and suppose that tyW satisfies properties (5) - (7). If 
u>w satisfies (5a) or (5b), we stop; otherwise, ti>w must satisfy (5c), i.e. it>M i s 
some expression in u^\v^ G S + such that uj^i)! 7^ u Si)r We w ^ show how 
to generate the next factorization -u;(* +1 ) from 

Using Lemma [2], we explicitly factor izW and t)W into one of the following 
forms: 

(8) = an d V M = (y*+iy 4 +i))p l+ i M (*+i) & ( 4 +i) wnere > , 

0\ T v \ 5 ana °|6(i+i)| T v \ v (i+i)r 



(9) «W = and v« = ( m (*+i))p*+i^+i) where p i+1 > 0, ^ 

ana 0| 6(i+1) | ^ M | M(l +i)|- 



(10) uW = and v« = («( i + 1 ) w ( i + 1 ))^+i u ( i + 1 ) where p i+x > and 

"| W (<+1)| 7= U \ v (i+1)\- 

(11) «W = and v {i) = (u( i+1) ) p * 1 where p i+1 > 1. 

Above, y( l+1 \ and are words in S + . We note that izW and 

cannot be factored into the form (11), since this would imply that w is 
imprimitive, which is a contradiction. To obtain the factorization w^ l+1 ^ of w, 
we substitute «w a nd t>w i n tyW by the appropriate expressions above. 

For example, if satisfies (5a) or (5b) (i.e., is equal to the ex- 
pression in (1) or (2) shown earlier), we stop; otherwise, must satisfy 
(5c), i.e., u>W i s equal to the expression in (3) shown earlier. Thus, w^ 1 ' = 
(u^v^) Pl u ( - 1 \ Depending on whether we have factored w 1 ' and into 

the form (8), (9), or (10), we have = ((u^v^) P2+1 u^b^) Pl u^v^ , 
w {2) = ^ u (2)y P2 +i b (2)y lu (2)^ OTW (2) = ^ u (2) v (2)y 2+ i u (2)y lu (2) v (2)^ respectively. 

Figure [3] shows how the factorization of w may look like in the universal 
graph F w . 

Claim 1. satisfies properties (5) - (7). 

Proof of ClaimUl By comparing the expressions in (8) - (10) to properties 
(5a) - (5c) respectively, we see that satisfies property (5). Using the fact 

that u;W satisfies properties (5c) and (6), one can easily verify (by considering 
cases (8) - (10) separately) that also satisfies property (6). 
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Since satisfies property (7) and we factored into some expression 
that starts with the first symbol of w^ +1 ^ is u^ +1 \ and the letter w m 

corresponds to the first letter of a non- first occurrence of in w^ l+1 \ Now, 

we shall determine which pairs of distinct subexpression-equivalent paths in 
■u/* +1 ) trace out the longest word, if such a pair exists. 

We note that such a pair must exist, since any pair of distinct subexpression- 
equivalent paths in w^ t+1 ^ cannot trace a word that is at least in length, 
since this would imply that I w contains a cycle, which is a contradiction. Now, 
consider any pair of distinct subexpression-equivalent paths in w^ +1 \ If the 
pair of paths starts with t>( l+1 ) or b^ l+1 \ then there is another pair of paths 
that trace out a longer word, since both t>( l+1 ) and are always cyclically 

preceded by vS l+1 \ Thus, we can assume that the pair of paths starts with 
u (i+i). 

Some of the u^+^'s in u>( l+1 ) correspond to the «W's in iyW in the sense 
that the u^ +1 ^s come from the expression we wrote for (as opposed to 
coming from the expression we wrote for v^). Such uW's are called old, while 
the other m^'s are called new. 

Consider the pair of u(' +1 )'s that the pair of paths start with. If the pair of 
initial u^+^'s are both new, then there is another pair of paths that trace out 
a longer word, since both u(' +1 )'s are cyclically preceded by the same symbol 
in w( l+1 \ (Consider cases (8) - (10) separately, and recall that is always 
cyclically preceded by in w^\) 

If one of the initial M ( i+1 )'s is old while the other is new, then one can 
easily verify that the word traced out by the pair of paths is at most \v^\ in 
length; let M be the smallest nonnegative integer that is > the length of the 
word traced out by any such pair of paths in w^ l+1 \ (For the case where we 
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have written and v W in the form (10), recall that is always cyclically 
followed by in One can easily verify that for case (8), we have 

M = |(u( < + 1 M i + 1 ))«+iu( i + 1 )|; for case (9), we have M = |( M (*+i))ft+i^+i)| ; an d 
for case (10), we have M = = \( u (i+i) v (i+i)y I+ i u (i+i)^ 

Now, suppose that both of the initial u^+^'s are old. Then, the pair of 
initial u(' +1 )'s corresponds to a pair of u^'s in Let A denote the set of 
pairs of distinct subexpression-equivalent paths in that both start with 
and are of maximal length. Let B denote the set of pairs of distinct 
subexpression-equivalent paths in w^ l+1 ^ that both start with old -u^ +1 ) and 
are of maximal length. We note that there is a natural bijection between A to 
B. Specifically, a pair of paths in B that both start with old u^ l+1 ' ) corresponds 
to the pair of paths in A that start at the corresponding pair of «W's in 

We claim that every pair of paths in B traces out a word that is exactly 
M letters (in E) longer than the corresponding pair of paths in A. To see this, 
consider any pair of paths in A; one of the paths must end right before a 
while the other path must end right before a (since is an expression 
in and and the pair is of maximal length). We note that the corre- 
sponding pair of paths in B would reach the expressions used to substitute 
this specific occurrence of and v ^. Then, looking at the expressions we 
wrote for and we see that the claim holds. 

Thus, a pair of paths in B that traces out the longest word (out of all the 
pairs of paths in B) corresponds to a pair of paths in A that traces out the 
longest word (out of all the pairs of paths in A). Since satisfies property 
(7), we now see that w^ 1 ^ also satisfies property (7). □ 

If we have factored and into the form (8) or (9), then -u/* +1 ) satisfies 
property (5a) or (5b), so we stop. Otherwise, we must have factored 
and into the form (10). In this case, we have max{|tt^ +1 ^|, \v < 
\u®\. Thus, the factoring process must eventually stop; we denote the final 
factorization of w by w^ N \ We note that satisfies properties (5) - (7), 
and furthermore, satisfies property (5a) or (5b). 

We now describe a set X of vertices in I w . X is carefully defined so that 
it induces a set of components of I w that is closed under transitivity of pairs 
and whose unique longest component is C. To define X, we will be using the 
final factorization of w, which has all the properties we need. The exact 
definition of X depends on whether satisfies property (5a) or (5b). 

Let A denote the set of pairs of distinct subexpression-equivalent paths in 
W ( N ) of maximal length. Each pair of paths in A corresponds to a pair of 
distinct walks in T w , say Y , Y ± , . . . , Y r and Z , Z ± , . . . , Z r , that trace out the 
same word; these two walks define a set of vertices in /„,, namely {{Y ,Z }, 
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{Yi, Zi}, {Y r ,Z r }}. If w( N > satisfies property (5a), we let X equal the 
union of all the sets of vertices obtained from pairs of paths in A that start 
with u^ N \ If satisfies property (5b), we let X equal the union of all the 
sets of vertices obtained from pairs of paths in A. 

Claim 2. The vertices in X induce a set of components of I w that contains 
C and is closed under transitivity of pairs. Also, C is the (unique) longest 
component induced by X . 

Proof of Claim® We first suppose that w^ N > satisfies property (5a). Consider 
any pair of paths in A that start with vS N '. We claim that one of the paths 
ends right before a while the other path ends right before a v^ N \ This 
can be seen by noting that the pair of paths start with u^ N \ no occurrence 
of u {N) is cyclically followed by u^ N \ and any occurrence of v^ N ' or 

b (N) 

IS 

always cyclically followed by u^ N \ Since ^ v[ N ' (property (5a)), we see 
that the pair of walks in r„, corresponding to the pair of paths end at a pair 
of vertices that has degree 1 in I w . Since b^l^ ^ t),^,, and no occurrence 

of U W is cyclically preceded by (property (5a)), if the pair of walks in 
r„ start at a pair of vertices that has degree 2 in I w , then the pair of initial 
u (N)i s j s C y C li ca lly preceded by a pair offer's or a pair of u^'s, both of which 
are cyclically preceded by a pair of w^'s (property (6)). This shows that X 
induces a set of components of I w ; furthermore, for every component induced 
by X, there exists a pair of paths in A starting with u'^ that produces exactly 
the vertices of the component. The case where to^ satisfies property (5b) can 
be handled similarly. 

We now show that X is closed under transitivity of pairs. We first suppose 
that satisfies property (5a). Let {U, V} and {V, W} be distinct vertices in 
X. Let D (respectively, D') be the component containing {U, V} (respectively, 
{V, W}). Each component induced by X corresponds to some pair of paths in 
A that start with u^ N \ Thus, each component has a starting vertex and an 
ending vertex. 

Suppose that {U, V} is the starting vertex of D. Then, we note that {U, V} 
corresponds to a pair of locations in w^ N \ each of which is cyclically in between 
two symbols (as opposed to being within a symbol). Since D corresponds to 
some pair of paths in A that start with vS N \ the pair of symbols cyclically 
after (the pair of locations of) {U, V} in w^ N ' are both u^ N \ Since D' contains 
{V, W} and corresponds to some pair of paths in A, and since V corresponds 
to some location in w^ N ' that is cyclically in between two symbols, W also 
corresponds to some location in w^ N ' that is cyclically in between two symbols. 

We claim that the symbol cyclically after (the location of) W in w^ N ' is 
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also a u( N >. To see this, we note that the pair of paths corresponding to D' 
is in A and starts with u^ N \ so the pair of paths must end right before a b( N > 
and a (shown earlier in this proof of this claim). Thus, the pair of paths 
cannot end at the pair of symbols cyclically before {V, W} since the symbol 
cyclically after V is a u^ N \ so the pair of paths must go past this pair of 
symbols. Thus, the symbol cyclically after W matches the symbol cyclically 
after V . Thus, the symbol cyclically after W in w^ N ' is also a u^ N \ so {U, W} 
is in X. A similar argument shows that if {V, W} is the starting vertex of D', 
then {U, W} is in X. 

Now, suppose that both {U, V} and {V, W} are not starting vertices. Also, 
suppose that for every vertex {U', V'} in D that is between the starting vertex 
and the vertex preceding {U, V} (inclusive), if {U', V'} (nontrivially) intersects 
any other vertex in X, then their symmetric difference is still in X. Then, the 
symmetric difference of the pair of vertices preceding {U, V} and {V", W} is in 
X, and we note that this vertex is a neighbor of {U, W}. Thus, {U, W} is in 
X. The case where w^ N ' satisfies property (5b) can be handled similarly. 

Since satisfies property (7), C is the longest component induced by 
X. □ 

This completes the proof of the theorem. □ 



5 Discussion of the 4>(w) = (3(w) Conjecture 

The main conjecture we want to show is that 4>{w) = f3(w) for every word 
w6E*. We can rewrite this conjecture in the following form: 

Conjecture 1. For every word w G S* and every non-negative integer i, we 
have 4>{w) = i iff (3{w) = i. 

We note that Conjectured] automatically implies that (f)(w) = oo iff (3{w) = 
oo. In |LP09j . Linial and Puder implicitly make the following conjecture: 

Conjecture 2. For every word w G X* and every non-negative integer i, if 
(3{w) > i, then 

a (w) = ■ ■ ■ = aj_i(w) = and 
ai(w) = \{T G Q w : x(T) = i and T has type B}\. 

It is easy to see that if Conjecture [2] is true, then for every word w G £*, 
we have 4>{w) = (3(w), and if (3{w) < oo, we have 

|{r G Q w : x(r) = 0(w) and T has type B}| / 1 \ 
$ -^ = + (^MTi and 
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r has type A F has type B 

Recall that we have 

1 00 1 

*„(*) = — + £ P(r)+ £ nr) = £a,H-. 

reQuj reQ m i=o 

r has type A T has type B 

Conjecture [2] states that if Q w does not have any type-B quotients with char- 
acteristic less than i, then the contributions from the — - term and the type-A 
quotient terms (in § w (n) above) to the coefficients ao(w) , . . . , ai(w) cancel 
each other exactly so that ao(w) = ■ ■ ■ = aj_i(iu) = and ai{w) = \{T G Q w : 
x(r) = i and V has type B}|. In order to explain this cancellation, it may 
be tempting to conjecture that ^ reQ w P(T) = \ for every word w G S* 

r has type A 

so that & w (n) = ^ res m P(T) and Conjecture [2] holds. Or, it may be 

r has type B 

tempting to conjecture that ^ reQ w Pfi) = - for every word w G S* so 

r has type A 
x(T)<» 

that $ w (n) = £ reQ, P(r) reQ, P(r) and Conjecture holds. 

r has type A T has type B 

X(T)X 

Neither of these conjectures are true; u> = gigig^g-i and u> = g\g\g\g\ are 
counterexamples showing that these two conjectures are false, respectively. 

We know that (3(w), and the coefficients a (w), ai(w), ... in the power 
series of Q w are all invariant under cyclic shift and reduction of w. Further- 
more, it is implicitly shown in |LP09] that \{T G Q w : x(T) = (3{w) and Y has 
type B}| is also invariant under cyclic shift and reduction of w. Thus, when 
attempting to prove Conjecture [2, one can assume that the word w G S* is 
cyclically reduced. 

Recall that we have 

^>^£(r ) -nH)inK4 + - 

V£Q W v 7 1=1 v 7 j=l 1=1 v 

We can easily write explicit formulas for the first few coefficients of the 
power series Y^=o a i( w )^j ^ or ® ne can easu y verify the following: 



a (w) 



if has type A 

1 if r„, has type B 



23 



a,i(w) 



\{T G Q w : x(r) = 1 and F has type B}| if r„, has type A 
■1 if T w has type B 



If T w has type B (i.e., w reduces to the empty word), then $ w (n) = 1 — ~, so 
a 2 (u>) = a 3 (w) = ■ ■ ■ = 0. Thus, assuming that T w has type A, one can easily 
verify the following: 



a 2 {w) 



E 

rgQ„ 
x(r)=i 




/v r -l «r-l «r-l ft e r _1 fc e f _1 e f _1 fc "^r^ 1 

«.(») = E EE^-EEE i, + EEE «'+EE<' 

TgQu, \ i=l j=i+l i=l j=l J=l 3=1 i=l J'=I+1 3=1 J=l 

x(r)=i v 

k k eL-lel-l \ 



+E E EE"' - E 

3=1 3'=3+l '=1 !'=1 / TGQ^ 

/ x(r)=2 




3=1 



|{re Q w : X (r) = 3}| 



There may be other ways of writing the contribution from characteristic 1 
quotients to a 3 (w) that are more useful and meaningful. From the formulas 
for a (w) and ai(w), we see that Conjecture [2] holds for i — 0, 1. In our proof 
of 4>(w) = 2 iff (3(w) = 2, we have actually shown that Conjecture [2] holds for 
i = 2. It is easy to show that for every non- negative integer j, if Conjecture 
[1] holds for every i < j and Conjecture [2] holds for i — j, then Conjectured] 
also holds for i = j. Using this fact, it is also easy to show (by induction on 
j) that for every non-negative integer j, if Conjecture [2] holds for every i < j, 
then Conjecture [1] also holds for every i < j. 

We, as well as Linial and Puder, believe that Conjecture [2] is true, and 
this conjecture implies that <p(w) = /3(w) for every word w G S*. We now 
know that Conjecture [2] is true for i — 0, 1, 2, and Linial and Puder have 
carried out extensive numerical simulations to test Conjecture [2] for i — 3,4 
without any failure (see (LP09J). We believe that a good strategy for proving 
the 4>(w) = /3(w) conjecture is to actually prove Conjecture [2j 

Recall that in the proof of 4>{w) = 2 iff j3{w) = 2, we showed there exists 
a bijection between the components of I w and the quotients in Q w that have 
characteristic 2 and type A. Now, fix w G S*. From here on, we will assume 
that w is cyclically reduced, <f>(w) > 3, and (3(w) > 3. Since (3{w) > 3, 
we note that T w is the only quotient (in Q w ) of characteristic 1, and every 
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quotient of characteristic 2 has type A. We note that every type-A quotient 
of characteristic 3 can be obtained by joining two vertices of a quotient of 
characteristic 2 (and then performing a realization operation). I.e., for every 
type-A quotient T G Q w of characteristic 3, there exists a quotient V G Q w of 
characteristic 2 and a component C of Ir> such that rel(T' /V(C)) = T. 

Looking at our formula for a 3 (u>), we note that E reQ w ( ( ? 2 r ) — EjLi ( 6 J ) ) 

x(r)=2 V / 

— E res™ ((# of components of Jr) — (# of independent cycles in Jr)). It is 

X(T)=2 

not hard to show that for every quotient T G Q w of characteristic 2, every com- 
ponent of Jr contains at most 1 cycle each. Thus, E reg m ( (T) — E*=i ) ) 

X(T)=2 V / 

— Eres„ (# of acyclic components of Jr). Using the identification graph (of 

X(T)=2 

a quotient), it is not hard to show that for every quotient T G Q«, of character- 
istic i, if C is a component of Jr that implies a component (possibly itself) con- 
taining a cycle, then reZ(r/V(C)) has characteristic at most i. (This is actually 
true for any word w G £*.) Thus, for every type-A quotient Y of characteristic 
3, there exists a quotient T' of characteristic 2 and an acyclic component C of 

Jiv such that rel(T'/V(C)) = T. Thus, ~Zre Qw ({^) - E?=i (!)) + l{r e 

x(r)=2 V / 

Q w : x (r) = 3 and r has type A}| < 0. 

It may be true that for every quotient r G Q w of characteristic 2, if C 
is a component of 7p that does not imply a component containing a cycle, 
then reZ(r/V(C)) has characteristic 3 (and thus type A as well). Assuming 
this is true, we have a surjective function from {C : C is a component of I? 
that does not imply a component containing a cycle, where T is any quotient 
in Q w of characteristic 2} to {T G <2 W : x(r) = 3 and T has type A}. This 
function is normally not injective, but it may be the case that this function is 
injective when restricted to {C : C is a component of Jr that does not imply 
a component containing a cycle} for any single quotient V of characteristic 
2. It may also be true that for every quotient Y G Q w of characteristic 2, a 
component C of Jr does not imply a component containing a cycle iff C itself 
does not contain a cycle; this would simplify the above statements a bit. 

In any case, we suspect that the contribution of T w to a%(w) is always non- 
negative and exactly cancels the contribution of the quotients of characteristic 
2 and the type-A quotients of characteristic 3. I.e, the contribution of Y w to 

a 3 (w) is Er 6S _, (f 2 r ) - E-=i (f)) " |{F G C : x(H = 3 and T has type 

A} |, so that a 3 (w) = \{F G : x(T) = 3 and T has type B}|, as desired. 
The contribution of T w to a 3 (w) seems to be compensating for the "lack of 
injectivity" of the above function. 
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It would be good to be able to express the contribution from quotients 
of characteristic 1 to 03 (w) in a more useful and meaningful form. It would 
also be good to be able to formulate Conjecture [2] for % = 3 (and higher) as 
a simpler and more accessible combinatorial problem, like what was done for 
i = 2. 
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